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METAL BINDING PROTEINS, RECOMBINANT HOST CELLS 

AND METHODS 
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BACKGROUND OF THE INVENTION 
The field of the present invention is the area of molecular biology, in particular as 
related to a metal-binding protein produced through recombinant DNA technology. 

One of the best-characterized mercury resistance {mer) operons is located on transposon 
Tn27 from the Shigella flexneri IncFII plasmid RIOO. This operon consists of five structural 
genes - merT, merP, merC, mer A, and merD - and a regulatory gene, merR (Fig. 1). MerT, 
MerP, and MerC are involved in the transport of Hg(II) into the cell. Mer A is a cytosolic, 
NADPH-dependent, flavin adenine dinucleotide-containing oxidoreductase which reduces 
Hg(II) to Hg(0). The merR gene is transcribed in the direction opposite from that of the 
structural genes. Its product, MerR, represses expression of the merTPCAD genes in the 
absence of Hg(II), activates their expression in the presence of Hg(II), and represses its own 
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expression in the presence or absence of Hg(II). MerD plays a minor role in regulation, 
possibly as an antagonist of MerR, which reestablishes repression of merTPCAD once Hg(II) 
has been reduced to Hg(0). 

Extensive genetic and biochemical data indicate that the protein contains three domains 
[Rossetal. (1989)7. BacterioL 171:4009-4018; Shewchuket al. (1989) 28: 2340- 
2344; Summers, A.O. (1992)7. BacterioL 174:3097-3101]: a helix-turn-helix DNA-binding 
domain from LIO to R29 [Brennan et al.(1989) 7. BioL Chem. 264:1903-1906; Livrelli et al. 
(1993) 7. BioL Chem. 268:2623-2631]; a "coupling" domain from K30 to H81 which may 
convey the status of the Hg(II) binding site to the DNA binding site [Comess et al. (1994) 
Biochemistry 33:4175-4186; Helmann et al. (1990) Science 247:946-948]; and a long helical 
region from C82 to CI 17 which constitutes both the dimer interface and, with the loop 
containing C126, the Hg(II)-binding domain [Zeng et al. (1998) Biochemistry 37:15885- 
15895]. 

The MerR homodimer binds Hg(II) by using the thiols of three conserved cysteines: 
cysteine 82 (C82) from one monomer and cysteines 117 and 126 (CI 17 and C126) from the 
other monomer. These ligands form a novel planar tricoordinate complex with Hg(II) [Helman 
et al. (1990) supra; O'Halloran, T.V. (1993) Science 261:715-725; Utschig et al. (1995) 
Science 26 3%0-3%5\ Wright et al. (1990) 7. .4m. C/^^m. 5^?c. 112:2434-2435]. Upon binding 
Hg(II), MerR undergoes a conformational change that leads to an underwinding of the 
region and thereby enables RNA polymerase to form an open complex. Curiously, although 
the MerR homodimer contains two potential Hg(II) binding sites, the purified protein binds 
only one Hg(II) per dimer. Moreover, although the two other group 12 metals, Zn(II) and 
Cd(II), also form stable complexes with protein thiols [Boulanger et al. (1983) Proc, NatL 
Acad. Sci, USA 80:1501-1505; Furey et al. (1986) Science 231:704-708; Santos et al. (1991) 
7. Am. Chem. Soc. 113:469-474; Utschig etal. (1993) Methods EnzymoL 226:71-97], purified 
MerR binds Hg(II) preferentially even in the presence of a 1 ,000-fold excess of Cd(II) or 
Zn(II) [Shewchuk et al. (1989) Biochemistry 28:2331-2339] and also requires 100- to 1,000- 
fold higher concentrations of these metals for transcriptional activation [Ralston et al. (1990) 
Proc. NatL Acad, Sci. USA 87:3846-3850. 



Although several studies have described mutants altered in either repression or 
activation, no genetic study of merR has explicitly addressed the basis of its metal specificity 
prior to Caguiat et al. (1999) /. Bacterial, 181:3462-3471. Caguiat et al. report variants of 
MerR with an altered response to a metal; the properties and locations of these mutations shed 
light on the basis for metal-provoked activation by MerR. That also led those authors to 
examine the possible occurrence of similar secondary-structure elements in other members of 
the MerR family, and our findings in the latter regard indicate conservation of a structural 
domain in a subset of this family . 



There is a long felt need in the art for compositions and methods useful in the 
remediation of materials contaminated with heavy metal ions, especially mercury and 
cadmium. The present invention provides artificial metal binding proteins, coding sequences, 
recombinant DNA molecules, recombinant host cells and methods for production of the 
artificial metal binding proteins. 



SUMMARY OF THE INVENTION 
The present invention provides DNA sequences encoding artificial metal binding 
proteins, which are termed chelons herein, and DNA sequences encoding the mercury binding 
protein MerR (derived from Tn21), vectors comprising those sequences operably linked to 
promoters functional in particular host cells of choice, recombinant host cells expressing the 
chelon proteins or MerR protein and methods of producing the chelon proteins and MerR 
protein recombinantly. Host cells include, but are not limited to, bacterial cells (e.g., 
Escherichia coli. Bacillus subtilis, Pseudomonas species, etc.), yeast cells (e.g., 
Saccharomyces cerevisiae or Pichia pastoris), fungi (e.g., Aspergillus species, Trichoderma 
reesei), plant cells (e.g., Arabidopsis, tobacco, petunia, Populus species, Salix species, among 
others) and animal cells such as CHO cells or avian cells. 

^ i^e present invention further provides chelon proteins which bind heavy metal ions, 
including but not limited to divalent cadmium and mercury ions, with relatively high affinity. 
The amino acid sequence of a mercury-specific chelon is disclosed in Table ID, and the 
nucleotide sequence encoding it is present in Table IC. Additional chelons which bind both 



iftercury and cadmium ions with high affinity are disclosed in Table 2. The chelon proteins 
canNbe within cells or on the surfaces of the cells in which they were produced, for example, 
for u\e in methods of concentrating heavy metal ions from a contaminated aqueous 
environVnt, or waste stream, or the chelon proteins can be immobilized onto a solid support 
for use inVemoval of heavy metal ions from a contaminated aqueous medium. The MerR 
protein canVlso be produced for in situ metal ion binding, or it can be purified and 
immobilized tAa support material. 

Also within the scope of the present invention are methods of removing heavy metal 
ions from contaminated aqueous solutions, waste streams or contaminated environments, for 
example, using immobilized chelon or MerR proteins, immobilized cells containing the MerR 
or chelon proteins or using whole plants to take up, sequester and concentrate the heavy metal 
ions from contaminated soil, ground water, hydroponic solutions, irrigation water or waste 
streams. Selective expression of the MerR or chelon proteins in above-ground plant parts 
(such as stems and leaves) allows the harvesting of the plant material in which the heavy metal 
ions have been sequestered and concentrated. Those plant parts can then be disposed of or 
processed in an environmentally appropriate manner. 



BRIEF DESCRIPTION OF THE DRAWINGS 
20 ^-p^jl^ures lA-lC illustrate the full length wild-type MerR protein, as the monomer ic 
metal bindmg domain, the dimeric coiled-coil metal binding domain and the mercury-bound 
dimeric metal^inding domain, respectively. In each, the tube represents the coiled-coil region, 
the small open Stoheres represent cysteine residues, the hatched curve is the short loop regions, 
the large cross-hatched circles represent all other regions of the protein, and the small black 
25 sphere represents ^S^nd mercuric ion. Dimer in solution binds one Hg ion because of the 
conformational chan^ Figure IB illustrates the chelon protein with its tandem metal binding 
domains. The shapes amas in Figure 1 A except that the hatched curves represent the nonwild- 
type linker residues whicK\allow association of the coiled-coils and stabilize the protein. 



Figure 2 graphically illustrates the results of Hg(II) binding as measured by equilibrium 
ultrafiltration. The capacity of the chelon appears to be higher than that of the naturally 
occurring MerR protein. 

5 Figure 3 provides the results of Matrix Assisted Laser Desorption Ionization Time of 

Flight (MALDI-TOF, using a Bruker Reflex Instrument) of a purified mercury-specific chelon 
protein as eluted from a StrepTag column. Peaks shown are the single positive charge at 
12,817.7 Da and the double positively charged species at 6412.0 Da. Both peaks are in 
reasonable agreement with the expected values for the chelon protein (12821 .58 Da). Tailing 
10 on major peaks arises from random adducts of sinapinic acid from the matrix (224 Da). 

5 Figure 4 summarized steps in the cloning of the chelon coding sequence into the p ASK- 

y:i IB A3 vector. In PGR reaction 1 (with primers corresponding to SEQ ID NO: 13 and SEQ ID 

y NO: 14) and PGR reaction 2 (with primers corresponding to SEQ ID NO: 15 and SEQ ID 

j'15 NO: 16), each with pNH9 as template, the chelon coding sequence DNA fragment was 
■4 synthesized. The two PGR products were digested with Bsal and BamHI and ligated into 

pASK IB A3 (Genosys) which had been cut with Bsal. The ligation products were transformed 
J: into Escherichia coli XLl-Blue and transformants were selected using ampicillin. In the 

resulting recombinant vector (pASK MBD, carrying the chelon of SEQ ID NO: 4), chelon 
m20 expression is induced by anhydrotetracycline. Tetracycline can also be used to induce 

expression. PNH9 has been described in Hamlett et al. (1992) 7. Bacterial, 174:6377-6385. 

Figure 5 shows binding of ^^^Hg ions by cells expressing MerR or chelon protein (SEQ 
ID NO: 2); the data were collected in two experiments. 

25 

DETAILED DESGRIPTION OF THE INVENTION 
In order to provide a clear and consistent understanding of the specification and claims, 
including the scope to be given to such terms, the following definitions are provided. 

30 A coding sequence is the part of a gene or nucleic acid molecule which codes for the 

amino acid sequence of a protein, or for a functional RNA such as a tRNA or rRNA. 

5 



Complement or complementary sequence means a sequence of nucleotides which forms 
a hydrogen-bonded duplex with another sequence of nucleotides according to Watson-Crick 
base-pairing rules. For example, the complementary base sequence for 5'-AAGGCT-3' is 3'- 
TTCCGA-5'. 

5 

Downstream of means on the 3' side of any site in DNA or RNA, and upstream of 
means on the 5' side of a site in DNA or RNA. 

Expression refers to the transcription of a coding sequence into structural RNA (rRNA, 
10 tRNA) or messenger RNA (mRNA) and subsequent translation of a mRNA into a protein. 

An amino acid sequence that is functionally equivalent to a chelon or MerR protein of 
jD the present invention is an amino acid sequence that has been modified by single or multiple 

y amino acid substitutions, by addition and/or deletion of amino acids, or where one or more 

j J5 amino acids have been chemically modified, but which nevertheless retains the heavy metal 
binding activity of a chelon (or MerR protein) of the present invention. Functionally 
M equivalent nucleotide sequences are those that encode polypeptides having substantially the 

u same biological activity. 

="'20 Two nucleic acid sequences are heterologous to one another if the sequences are derived 

from separate organisms, whether or not such organisms are of different species, as long as 
the sequences do not naturally occur together in the same arrangement in the same organism. 

Homology refers to the extent of identity between two nucleotide or amino acid 
25 sequences. 

Isolated means altered by the hand of man from the natural state. If an isolated 
composition or substance occurs in nature, it has been changed or removed from its original 
environment, or both. For example, a polynucleotide or a polypeptide naturally present in a 
30 living animal is not isolated, but the same polynucleotide or polypeptide separated from the 
coexisting materials of its natural state is isolated, as the term is employed herein. 
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A linker region is an amino acid sequence that operably links two functional or 
structural domains of a protein, for example, between two metal binding domains of a chelon. 



A nucleic acid construct is a nucleic acid molecule which is isolated from a naturally 
5 occurring gene or which has been modified to contain segments of nucleic acid which are 
combined and juxtaposed in a manner which would not otherwise exist in nature. 

Nucleic acid molecule means a single- or double-stranded linear polynucleotide 
containing either deoxyribonucleotides or ribonucleotides that are linked by 3'-5'- 
10 phosphodiester bonds. 

y3 Two DNA sequences are operably linked if the nature of the linkage does not interfere 

S| with the ability of the sequences to effect their normal functions relative to each other. For 

=JJ instance, a promoter region is operably linked to a coding sequence if the promoter is capable 

W5 of effecting transcription of that coding sequence in a particular host cell of choice. 

□ A polypeptide is a linear polymer of amino acids linked by peptide bonds. 

□ A promoter is a cis-acting DNA sequence, generally 80-120 base pairs long and located 
20 upstream of the initiation site of a gene, to which RNA polymerase may bind and initiate 

correct transcription. There can be associated additional transcription regulatory sequences 
which provide on/off regulation of transcription and/or which enhance (increase) expression 
of the downstream coding sequence. 

25 A recombinant nucleic acid molecule, for instance a recombinant DNA molecule, is a 

novel nucleic acid sequence formed in vitro through the ligation of two or more 
nonhomologous DNA molecules (for example a recombinant plasmid containing one or more 
inserts of foreign DNA cloned into its cloning site or its poly linker). 

30 Transformation means the directed modification of the genome of a cell by the external 

application of purified recombinant DNA from another cell of different genotype, leading to 
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its uptake and integration into the subject cell's genome. In bacteria, the recombinant DNA 
is not typically integrated into the bacterial chromosome, but instead replicates autonomously 
as a plasmid. 

5 A vector is a nucleic acid molecule that is able to replicate autonomously in a host cell 

and can accept foreign DNA. A vector carries its own origin of replication, one or more 
unique recognition sites for restriction endonucleases which can be used for the insertion of 
foreign DNA, and usually selectable markers such as genes coding for antibiotic resistance, 
and often recognition sequences (e.g. promoter) for the expression of the inserted DNA. 
10 Connmon vectors include plasmid vectors and phage vectors. 



y The chelon specifically embodied by the present invention is an artificial protein 

yj comprising at least two metal binding domains derived from the MerR protein of the Tn21 mer 

y (mercury resistance) operon. In the presence of mercuric ions, the MerR protein binds 

5"'7 5 mercuric ions and serves as a positive regulator of the mercury resistance operon. An isolated 
metal binding domain polypeptide has been described [Zeng et al. (1998) Biochemistry 

M 37:15885-15895]. See also Caguiat et al. (1990 7, Bacterial. 181:3462-3471, for further 

n discussion of heavy metal binding by the MerR protein and cadmium-binding variants. 

h-20 present work provides the first demonstration that an independent sub-domain of 

a naturallytoccurring protein can be incorporated in a heavy metal sequestration protein which 
binds cognareunetal ions with high specificity and high affinity. Previously described metal 
binding protein^are those which bind divalent "beneficial" metal ions, e.g. , nickel, copper and 
zinc. Generally fbr those proteins, the affinities and specificities for heavy metals are lower, 

25 although some forrmof metallothionein bind divalent cadmium ions. The previously known 
metal binding proteins are subject to having the metal ligand competed off by naturally 
occurring thiols such ^ glutathione and cysteine. The relatively low affinities reflect the 
physiological roles of the^ proteins in serving as chaperones for the metal ions en route to the 
active sites of enzymes or structural sites of DNA binding proteins. The intact MerR protein 

30 and its metal binding domain have much higher affinity (about 9 orders of magnitude) than 
natural thiols such as glutattiione or cysteine and nonphysiological thiols such as 2- 
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mercaptoMianol [Zeng et al. (1998) supra]. The wild-type MerR protein binds mercuric ion 
with an affid^y of 10'^ M even in the presence of millimolar quantities of thiols, and it binds 
divalent cadniilm^ and divalent zinc ions a hundred-fold and thousand-fold less well. 

5 In the chelons of the present invention, the metal binding site requires that an 

antiparallel association of sequence motifs come together in a way that mimics the natural 
mercuric ion binding site of MerR dimers. The ability to bind metal ions by the chelon is 
documented in Figure 2. 

10 Bacterial cells over-expressing a chelon (MBD) or MerR protein intracellularly bind 

more Hg than a strain carrying only the plasmid vector (Fig. 5). Western analysis indicates 
that the MerR and chelon proteins are produced in similar amounts. However, MerR binds 

j^-^ Hg ions somewhat better than does the chelon protein. Without wishing to be bound by any 

particular theory, it is believed that this may arise because of differences in folding stability 

h;15 for the very short, artificial chelon compared to the full-length MerR protein inside the cell. 
Nonetheless, chelon protein expression clearly increases the amount of Hg bound. 

h.^ When the chelon and MerR proteins are each attached to a solid substrate (Streptactin 

Sepharose) in vitro via StrepTag streptavidin/Streptactin sequences (Genosys) both bind the 

^'20 solid substrate with equal efficiency, i.e. with very similar affinity (in terms of motes of 
protein bound to the resin), but the bound chelon is more effective than bound MerR protein 
in binding Hg ions (Table 3). Although the Hg occupancy of the chelon is only 50%, its 
proficiency in this regard compared to MerR (which binds Hg with ca. 3-fold less efficiency) 
is expected because each tethered copy of the chelon contains a complete Hg binding site. 
25 However, for MerR, the tethered monomer must either capture a free monomer from solution 
or must contact a monomer bound to an adjacent site on the substrate surface in order to form 
the dimeric Hg binding site. Without wishing to be bound by theory, we believe that this 
"search" process limits the efficiency of tethered fixll length MerR as compared to the single 
polypeptide chelon in capturing Hg. 

30 
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The chelons of the present invention bind a variety of transition metals with relatively 
high efficiency (affinity). In these experiments we exposed fully reduced chelon (SEQ ID NO: 
4) to a 2-fold molar excess of the indicated metals in the presence of a large excess of 
competing thiol (5 mM 2-mercaptoethanol) and then dialyzed the reactions extensively against 
5 a buffered thiol and determined the metal content by inductively coupled plasma-mass 
spectrometry (ICP-MS). The MBD binds cadmium, cobalt, copper, lead, nickel, and zinc with 
an affinity equivalent to that for Hg(II), i.e. one metal atom per protein molecule. 
Interestingly, it does not bind two other "thiophilic metals" iron and arsenite, A(III). 

10 The natural MerR protein has an additional property of interest: it undergoes a 

conformation (allosteric) change when metal ion binds to it [Heltzel et al. (1990) Biochemistry 
29, 9572-9584] . In the intact protein this change is transduced from the metal binding domain 
to the DNA binding domain and results in underwinding of the DNA to foster transcription 
H initiation. The MerR binding domain is the first functional metalloprotein subdomain which 

hi 5 is sufficiently small and stable for use outside of the cellular environment [Zeng et al. (1998) 
supra]. 



M The present inventors discovered that the chelon protein of SEQ ID NO: 4 binds the 

g same ions as the wild-type MerR protein. While the wild type MerR protein binds certain ions 

="'50 other than mercuric ions, those other ions do not induce transcription of the mer operon in vivo 
or in vitro. 

Because most proteins require that all of their subdomains fold together with each other 
in order to achieve a stable tertiary structure and avoid degradation, it was not obvious that any 
25 single subdomain of MerR could be produced as a stable functional subdomain within the cells . 
The present findings indicate that the tendency of the MerR metal binding domain to fold into 
stable, functional polypeptide structure is strong. It was also not obvious that the unnatural 
direct linkage of two of these domains would yield a stable protein, much less one which 
retained the metal binding properties of the natural MerR dimer. 

30 
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Building both halves of the metal binding domain into a single polypeptide eliminates 
the problem of achieving sufficiently high protein concentrations assure the optimum formation 
of dimers; i.e., the two halves of the binding domain don't need to "find" each other because 
they are covalently joined by a linker peptide region. Even with an affinity tag the chelon 
derivative is only slightly larger than a typical metallothionein and it will be much less costly 
to the cell to produce than the fiill length MerR protein. The stability of the folded form of the 
metal binding domain and the allosteric change which the MerR metal binding domain 
undergoes allow its use outside the cell in both metal detection and metal sequestration 
strategies. 

In metal sequestration application of the present chelon technology, the cells in which 
the chelon is expressed contain strongly bound metal ions which will not significantly exchange 
with extracellular metal ions. Due to the high affinity binding, those cells are significantly 
more resistant to the heavy metal ions than comparison cells which do not express the chelon 
protein(s). Bacterial cells engineered to express at least one chelon coding sequence are useful 
in bioremediation of contamination sites. Colonizing the intestinal tract with nonpathogenic 
nontoxigenic bacteria engineered to express at least one chelon reduces enterohepatic recycling 
of mercury and makes elimination more rapid and more efficient. Most of the ingested and 
inhaled mercury is processed through the intestinal tract of primates, and colonization with 
chelon-producing microbes reduces uptake of heavy metal ions into circulation. 

Purified chelon proteins can be used isolated away from the cellular environment. 
Present water treatment resins have relatively low affinities for toxic metals when nontoxic 
metal are in comparatively high concentration and compete for binding to the resin. A resin 
which "ignores" nontoxic metals and binds only a target metal of interest (especially mercury 
or cadmium cations) can augment existing water treatment regiments. Immobilizing a chelon 
of the present invention to a solid support allows for the design of water treatment resins useful 
in removing mercury and/or cadmium, depending on the choice of the particular chelon or a 
combination of chelons. Suitable solid supports include, without limitation, latex particles, 
glass beads, hydrogels, polystyrene and liposomes. The particles or beads can be fabricated 
from a variety of materials and can have any shape, including a spherical shape. The chelon(s) 
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can be on the surfaces of the beads or particles, or they can be within porous beads or 
particles . The beads can have added surface groups to act as spacers to improve access to the 
metal ion binding sites of the attached chelon proteins or to facilitate attachment of the chelon 
molecules, for example carboxyl groups on latex or amino modifications on polystyrene. 
5 Polymers with free amino groups that are useful in conjunction with gelling agents include, 
without limitation, alginate amine, chitosan, pectin, and polyethylene imine. Additionally, 
affixing the chelon(s) to microspheres with selective permeability allows it to access mercury 
attached to low molecular weight thiols in the environment while protecting the chelon protein 
from degradative enzymes in the environment. Magnetic versions of such microspheres can 

10 be disseminated in the soil. After a time sufficient to allow the microspheres to become 
saturated with toxic metal ions, the beads are recovered via standard magnetic recovery 

;f techniques . 



J <g^ty^ ^MerR or chelon proteins can be readily engineered to facilitate purification and/or 
15 ^^^ii^obil^ation to a solid support of choice. For example, a stretch of 6-8 histidines can be 
=^ engineered Through mutagenic polymerase chain reaction or other recombinant DNA 

^ technology to aU^ow purification of expressed recombinant protein over an nitrilotriacetic acid 

I (NTA) colunan ustog commercially available materials. Other oligopeptide "tags" which can 

f be fused to a protein of interest by such techniques include, without limitation, strep-tag 

20 (Sigma-Genosys, The\Woodlands, TX) which directs binding to streptavidin or its derivative 
strep tactin (Sigma-Genbsys); a glutathione-S-transferase gene fusion system which directs 
binding to glutathione copied to a solid support (Amersham Pharmacia Biotech, Uppsala, 
Sweden); a calmodulin-binding peptide fusion system which allows purification using a 
calmodulin resin (Stratagene^La JoUa, CA); a maltose binding protein fusion system allowing 
25 binding to an amylose resin (^ew England Biolabs, Beverly, MA); and an oligo-histidine 
fusion peptide system which allocs purification using a Ni^'^-NTA colunrn (Qiagen, Valencia, 
CA). 



The range over which the chelons of the present invention act to bind mercury (and 
30 cadmium in the case of the mercury and cadmium specific chelon) extend from as low as 10"^ 
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M and up to about 1 M. Binding of mercury and cadmium ions is effective over the pH range 
from 5 to 9, desirably from 6 to 8.5, and more preferably from 7 to 8. 



Various techniques can be used to immobilize the chelon proteins on the surfaces of a 
5 solid support, provided that metal ion binding is not impaired due to the technique used for 
protein immobilization. The protein can be coated on the surface of the beads and/or interior 
surfaces of porous beads can similarly be coated. Chelon protein can be mixed with liquid 
phase material prior to gelling and bead formation. In a specifically exemplified embodiment, 
either chelon or MerR protein is linked to Streptactin Sepharose. 

10 

With poisoning with mercury, the enterohepatic recycling of mercury slows its 
=p elimination from the body. Since colonization of humans with a recombinant strain of bacteria 

is likely to face regulatory hurdles, an alternative strategy is the use of microspheres to which 
■ I the chelon is covalently bound. Mounting the chelon protein in a microsphere with pores 

yjl 5 sufficiently large to allow diffusion of a low molecular weight thiols but sufficiently small to 
„ ' block diffusion of digestive proteases produces a robust high affinity and high specificity 

]Z nonabsorbable sequestering agent. These microspheres pass readily through the intestinal tract, 

collecting mercury on the way to elimination via natural processes. An advantage of the 
Ci chelon-containing microspheres of the present invention over prior art sulfonated amberlite 

■ ™20 resins is the increased specificity for the toxic metal over that of other divalent cations, such 

as copper or zinc, whose depletion would be deleterious to the human or animal host. 

It is well known in the biological arts that certain amino acid substitutions can be made 
in protein sequences without affecting the function of the protein. Generally, conservative 

25 amino acid substitutions or substitutions of similar amino acids are tolerated without affecting 
protein function. Similar amino acids can be those that are similar in size and/or charge 
properties, for example, aspartate and glutamate and isoleucine and valine are both pairs of 
similar amino acids. Similarity between amino acid pairs has been assessed in the art in a 
number of ways. For example, Dayhoff et al. (1978) in Atlas of Protein Sequence and 

30 Structure, Volume 5, Supplement 3, Chapter 22, pages 345-352, which is incorporated by 
reference herein, provides frequency tables for amino acid substitutions which can be employed 
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as a measure of amino acid similarity. Dayhoff et ai.'s frequency tables are based on 
comparisons of amino acid sequences for proteins having the same function from a variety of 
evolutionarily different sources. 

5 The complete nucleotide sequence encoding the MerR protein of Tn21 is available on 

GenBank, Accession No. P07044. Plasmid pASK-IBA3 is commercially available from 
Sigma-Genosys (The Woodlands, TX). The StrepTagll (trademark of Institut fiir Bioanalytik 
GmbH) technology (which depends on strep tavidin binding by particular residues, i.e., 
WSHPQFEK amino acid residues present at the C-terminus of the recombinant chelon protein; 
10 amino acids 110-117 of SEQ ID NO:4) is described in United States Patent No. 5,506,121, 
incorporated by reference herein. The recombinant mercury-binding chelon expression 
y plasmid is constructed by joining PGR amplicands of two copies of sequences encoding the 

=11 metal binding domain (residues 81-127) of MerR. To facilitate joining of these 2 metal binding 

sj domains coding sequences in a direct tandem repeat, the primers used in the amplification are 

rl 5 designed to have a common BamHI restriction endonuclease recognition site to allow the to be 
"'J joined and also to include a linker regions between them (See Fig. 4). The outer primers add 

Hi= a Bsal site to each end, allowing the entre chelon coding sequence to be cloned into the vector 

\""t 

i2 pASK-IBA3 (at the Bsal site remaining after the short Bsal fragment is removed and both Bsal 

i;f fragments were eliminated). The expression of the tagged protein is under the control of the 

h20 tetracycline-inducible tetA promoter, and the polylinker into which a coding sequence of 
interest is cloned contains restriction endonuclease recognition sites for Bsal and BsmFI. 
Crude extracts of the recombinant E, coli cells containing the tagged chelon protein are treated 
to purify the tagged protein in accordance with instructions from the manufacturer. 

25 Alternatively, controlled expression of the chelon can be achieved by inserting the 

coding sequence into pBAD (Invitrogen, Carlsbad, CA) for expression induced by arabinose. 

The nucleotide sequence encoding the chelon protein in given in Table IB. It is 
inserted into the vector for tetracycline-regulated synthesis at a Bsal restriction site. 

30 
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For mercury ion-regulated expression of the chelon coding sequence, the chelon coding 
sequence is cloned using pCC306 [Condee and Summers (1992) 7. BacterioL 174:8094-8101] . 
The luciferase coding sequences are replaced with the chelon coding sequence. Chelon 
expression is regulated by mercury in conjunction with the naturally occurring MerR protein. 
There is no need for a tag to assist in purification because recombinant host cells with the 
mercury-regulated chelon expression plasmid are to be used without protein purification, at 
least in certain applications. This vector is based on the pl5A replicon, and it is suitable for 
use in all Enterobacteriaceae. Such recombinant enteric bacteria (which are nontoxigenic and 
nonpathogenic) are suitable for use in the in vivo sequestration and elimination of mercuric ion 
from the human or animal gastrointestinal tract. 

Techniques and agents for introducing and selecting for the presence of heterologous 
DNA, i.e., a chelon coding sequences operably linked to transcription and translation 
regulatory sequences functional in plant cells, in plant cells and/or tissue are well-known. 
Constitutive transcription regulatory sequences include the Cauliflower Mosaic Virus 35S and 
19S promoters, and sequences which provide for efficient translational expression are 
described in, e.g., United States Patent No. 5,668,294 (Meagher et al.) And United States 
Patent No. 5,874,242 (Mensa-Wilmot). Where expression is preferred in above the ground 
plant parts, a rubisco promoter, for example, from soybean, can be used to drive chelon 
expression. The soybean rubisco promoter sequence is available from GenBank as Accession 
No. X58684. Genetic markers allowing for the selection of heterologous DNA in plant cells 
are readily available, e.g., genes carrying resistance to an antibiotic such as kanamycin, 
hygromycin, gentamycin, or bleomycin. The marker allows for selection of successfully 
transformed plant cells growing in the medium containing the appropriate antibiotic because 
they will carry the corresponding resistance gene. In most cases the heterologous DNA which 
is inserted into plant cells contains a gene which encodes a selectable marker such as an 
antibiotic resistance marker, but this is not mandatory. An exemplary drug resistance marker 
is the gene whose expression results in kanamycin resistance, i.e. , the chimeric gene containing 
nopaline synthetase promoter, Tn5 neomycin phosphotransferase II and nopaline synthetase 3' 
non-translated region described by Rogers et al.. Methods for Plant Molecular Biology, A. 
Weissbach and H. Weissbach, eds.. Academic Press, Inc., San Diego, CA (1988). 
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Techniques for genetically engineering plant cells and/or tissue with an expression 
cassette comprising a constitutive promoter or an inducible promoter or chimeric promoter 
fused to a heterologous coding sequence and a transcription termination sequence are to be 
introduced into the plant cell or tissue by Agrobacterium-mQdmXQd transformation, 
electroporation, microinjection, particle bombardment or other techniques known to the art. 
The expression cassette advantageously further contains a marker allowing selection of the 
heterologous DNA in the plant cell, e.g., a gene carrying resistance to an antibiotic such as 
kanamycin, hygromycin, gentamycin, or bleomycin. 

A DNA construct carrying a plant-expressible gene or other DNA of interest can be 
inserted into the genome of a plant by any suitable method. Such methods may involve, for 
example, the use of liposomes, electroporation, diffusion, particle bombardment, 
microinjection, gene gun, chemicals that increase free DNA uptake, e.g., calcium phosphate 
coprecipitation, viral vectors, and other techniques practiced in the art. Suitable plant 
transformation vectors include those derived from a Ti plasmid of Agrobacterium tumefaciens, 
such as those disclosed by Herrera-Estrella (1983), Bevan (1983), Klee (1985) and EPO 
publication 120,516 (Schilperoort et al.). In addition to plant transformation vectors derived 
from the Ti or root-inducing (Ri) plasmids of Agrobacterium, alternative methods can be used 
to insert the DNA constructs of this invention into plant cells. 



The choice of vector in which the DNA of interest is operatively linked depends 

directly, as is well known in the art, on the functional properties desired, e.g., replication, 

protein expression, and the host cell to be transformed, these being limitations inherent 

art of constructing recombinant DNA molecules. The vector desirably includes.^ . 

replicon, i.e., a DNA sequence having the ability to direct autono^*^ 7>j ^, 

/>?c/, ^ ^^t 

maintenance of the recombinant DNA molecule extra-chromoso*; ^wc/e ^ 

prokaryotic host cell, such as a bacterial host cell. Suc> "^^gr^^^ . ^^^^rja/ , 
In addition, preferred embodiments that includ'^ ^e/e^^. ^e^^^ 

whose expression confers a selective ad^- j,^ 
cell when introduced into those ^' 
those that confer resistr 



neomycin phosphotransferase gene has the advantage that it is expressed in eukaryotic as well 

as nrokarvotic cells. 

J. ^ 

Those vectors that include a prokaryotic replicon also typically include convenient 
restriction sites for insertion of a recombinant DNA molecule of the present invention. Typical 
of such vector plasmids are pUC8, pUC9, pBR322, and pBR329 available from BioRad 
Laboratories (Richmond, CA) and pPL, pK and K223 available from Pharmacia Biotech 
(Uppsala, Sweden), and pBLUESCRIPT and pBS available from Stratagene (La Jolla, CA). 
A vector of the present invention may also be a Lambda phage vector including those Lambda 
vectors described in Molecular Cloning: A Laboratory Manual, Second Edition, Maniatis et 
ah, eds.. Cold Spring Harbor Press (1989) and the Lambda ZAP vectors available from 
Stratagene (La Jolla, CA). Other exemplary vectors include pCMU [Nilsson et al. (1989) Cell 
58:707] . Other appropriate vectors may also be synthesized, according to known methods; for 
example, vectors pCMU/K^ and pCMUII used in various applications herein are modifications 
of pCMUIV (Nilson et al., supra). 

Typical expression vectors capable of expressing a recombinant nucleic acid sequence 
in plant cells and capable of directing stable integration within the host plant cell include 
vectors derived from the tumor- inducing (Ti) plasmid of Agrobacterium tumefaciens described 
by Rogers et al. (1987) Meth, in EnzymoL 15^:253-211 , and several other expression vector 
systems known to function in plants. See for example, Verma et al., No. WO87/00551; 
Cocking and Davey (1987) Science 236:1259-1262. 

A transgenic plant can be produced by any means known to the art, including but not 
limited to Agrobacterium tumefaciens-rnQdidX^A DNA transfer, preferably with a disarmed T- 
DNA vector, electroporation, direct DNA transfer, and particle bombardment (See Davey et 
al. (1989) Plant MoL Biol. 13:275; Walden and Schell (1990) Eur. J. Biochem. 192:563; 
Joersbo and Burnstedt (1991) Physiol. Plant. 81:256; Potrykus (1991) Annu. Rev. Plant 
Physiol Plant MoL Biol 42:205; Gasser and Fraley (1989) Science 244:\29?>\ Leemans (1993) 
Bio/Technology. 11:522; Beck et al. (1993) Bio/Technology. 11:1524; Koziel et al. (1993) 
Bio/Technology. 11:194; and Vasil et al. (1993) Bio /Technology . 11:1533). Techniques are 
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well-known to the art for the introduction of DNA into monocots as well as dicots, as are the 

terhninnes for rultnring such nlant tissues and repenerating those tissues. 

^-.-^^-.-J — — ^ ^ J. ^ .. .. 

For use in pseudomonads and many other Gram negative bacteria, IncQ or IncP vectors 
are particularly appropriate, and a large variety of such vectors are known in the art. For use 
in Gram positive bacteria any of a number of vectors maintained and replicated in particular 
organisms of interest, especially Bacillus subtilis and other species, are well known and readily 
available to the art. See, e.g., Pouwels, P.H. et al. (eds.) Cloning Vectors, Elsevier, 
Amsterdam, NL. Vectors suitable for use in filamentous fungi such as Aspergillus and 
Trichoderma are known as are vectors for use in yeasts including Saccharomyces cerevisiae 
and Pichia pastoris. The merOP regulatory region and the MerR coding sequence are 
desirably inserted into the vector of choice according to the desired host cell so that MerR or 
chelon expression is turned on only in the presence of mercury. 

Polyclonal and/or monoclonal antibodies capable of specifically binding to a chelon of 
the present invention are provided. The term antibody is used to refer both to a homogenous 
molecular entity, or a mixture such as a serum product made up of a plurality of different 
molecular entities. Monoclonal or polyclonal antibodies specifically reacting with a chelon of 
the present invention can be made by methods known in the art. See, e.g., Harlow and Lane 
(1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratories; Coding (1986) 
Monoclonal Antibodies: Principles and Practice, 2d ed., Academic Press, New York. Also, 
recombinant immunoglobulins may be produced by methods known in the art, including but 
not limited to, the methods described in U.S. Patent No. 4,816,567. Monoclonal antibodies 
with affinities of 10^ M \ preferably 10^ to 10^^ or more are preferred. 

Antibodies specific for the artificial metal binding proteins (chelons) of the present 
invention are useful, for example, as probes for assessing expression, for determining amounts 
of artificial binding proteins in free or bound form or for detecting the presence of the 
artificial metal binding proteins of the present invention in a test sample. Frequently, the 
polypeptides and antibodies will be labeled by joining, either covalently or noncovalently, a 
substance which provides a detectable signal. Suitable labels include but are not limited to 
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radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent agents, chemiluminescent 

agwiitkj, xiiagixvLiV' pctx Li\-.iwo cxxxta lxxv^ xxKw. ^^ixxlv^vx olulk^.^ i aii^ixLd u^K^awx xuixx^ lxxu uac kjl duuii lauQia 

include, but are not limited to, Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 
4,275,149; and 4,366,241. 

Standard techniques for cloning, DNA isolation, amplification and purification, for 
enzymatic reactions involving DNA ligase, DNA polymerase, restriction endonucleases and 
the like, and various separation techniques are those known and commonly employed by those 
skilled in the art. A number of standard techniques are described in Sambrook et al. (1989) 
Molecular Cloning, Second Edition, Cold Spring Harbor Laboratory, Plainview, New York; 
Maniatis et al. (1982) Molecular Cloning, Cold Spring Harbor Laboratory, Plainview, New 
York; Wu (ed.) (1993) Meth, EnzymoL 218:Part I; Wu (ed.) (1979) Meth Enzymol, 68; Wu 
et al. (eds.) (1983) Meth. EnzymoL 100 and 101; Grossman and Moldave (eds.) Meth. 
EnzymoL 65; Miller (ed.) (1972) Experiments in Molecular Genetics, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York; Old and Primrose (1981) Principles of Gene 
Manipulation, University of California Press, Berkeley; Schleif and Wensink (1982) Practical 
Methods in Molecular Biology; Glover (ed.) (1985) DNA Cloning Vol. I and II, IRL Press, 
Oxford, UK; Hames andHiggins (eds.) (1985) Nucleic Acid Hybridization, IRL Press, Oxford, 
UK; and Setlow and HoUaender (1979) Genetic Engineering: Principles and Methods, Vols. 
1-4, Plenum Press, New York. Abbreviations and nomenclature, where employed, are deemed 
standard in the field and commonly used in professional journals such as those cited herein. 

Each reference cited in the present application is incorporated by reference 
herein to the extent that it is not inconsistent with the present Specification. 

The description provided herein is for illustrative purposes, and is not intended to limit 
the scope of the invention as claimed herein. Any variations in the exemplified articles and/or 
methods which occur to the skilled artisan are intended to fall within the scope of the present 
invention. 
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Table lA 



Amino acid sequence of wild-type MerR protein from Tn21 (SEQ ID NO: 2) 

MENNLENLT I GVFAKAAGVNVET I RF YQRKGLLRE PDKP YGS I RRYGE AD WRV 
KFVKSAQRLGFSLDEIAELLRLDDGTHCEEASSLAEHKLKDVREKMADLARMET 
VLSELVCACHARKGNVSCPLIASLQGEAGLARSAMP 



Table IB 

Nucleotide sequence encoding the wild-type MerR protein from Tn21 (SEQ ID NO: 1) 



1 


ATGGAAAATA 


ATTTGGAAAA 


CCTGACCATT 


GGCGTTTTTG 


CCAAGGCGGC 


51 


CGGGGTCAAC 


GTGGAGACAA 


TCCGCTTCTA 


TCAGCGCAAG 


GGCCTGTTGC 


101 


GGGAACCGGA 


CAAGCCTTAC 


GGCAGCATCC 


GCCGCTATGG 


GGAGGCGGAC 


151 


GTGGTTCGGG 


TGAAATTCGT 


GAAATCGGCA 


CAGCGGCTGG 


GGTTCAGTCT 


201 


GGACGAGATT 


GCCGAGCTGT 


TGCGGCTCGA 


CGATGGCACC 


CACTGCGAGG 


251 


AGGCCAGCAG 


CCTGGCCGAA 


CACAAGCTCA 


AGGACGTGCG 


CGAGAAGATG 


301 


GCCGACTTGG 


CGCGCATGGA 


AACCGTGCTG 


TCTGAACTCG 


TGTGCGCCTG 


351 


CCATGCACGA 


AAGGGGAATG 


TTTCCTGCCC 


GTTGATCGCG 


TCACTACAGG 



4 01 GCGAAGCAGG CCTGGCAAGG TCAGCTATGC CTTAG 



Table IC 

Complete amino acid sequence of the chelon protein (SEQ ID NO:4)showing novel residues 
not found in wild-type MerR, The last underlined residues are derived from the StrepTag 
vector (Genosys) and are not essential to the metal-binding domain nor do they interfere with 
metal binding (SEQ ID NO: 3). These residues are only important for purification of the 
protein. 

MTHCEEASSLAEHKLKDVREKM7VDLARMETVLSELVCACHARKGNVSCPLIASLQGSS 
GTHCEEASSIiAEHKLKDWEKMADIARMETVLSELVCACHARKGNVSCPSA^ 
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Table ID 



Complete 321 nucleotide DNA sequence encoding the essential 107 residues of the Chelon 
5 protein (SEQ ID NO: 3). This DNA sequence does not include the sequence encoding the 
StrepTag portion of the protein. The DNA sequence for the StrepTag is incorporated in the 
Sigma-Genosys vector. Any other affinity tag or none at all (i.e. the normal wildtype sequence 
of the protein) could be incorporated for use in alternative protein purification schemes. 

10 ATGACACACTGCGAGGAGGCCAGCAGCCTGGCCGAACACAAGCTCAAGGACGTGCGCGAGAA 
GATGGCCGACTTGGCGCGCATGGAAACCGTGCTGTCTGAACTCGTGTGCGCCTGCCATGCAC 
GAAAGGGGAATGTTTCCTGCCCGTTGATCGCGTCACTACAGGGATCCTCAGGCACCCACTGC 
GAGGAGGCCAGCAGCCTGGCCGAACACAAGCTCAAGGACGTGCGCGAGAAGATGGCCGACTT 
GGCGCGCATGGAAACCGTGCTGTCTGAACTCGTGTGCGCCTGCCATGCACGAAAGGGGAATG 

15 TTTCCTGCCCG 
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Table 2 



Amino acid sequences of chelons which bind cadmium as well as mercury ions with high 
affinity. The residues which differ from the mercury specific chelon (see Table IB) are shown 
5 with bold and underlining. 

MTH C E E VS S L AE H KL KD VRE KMAD L ARME T VL S E L VC ACHARKGNVS C P L I AS LQGSS 
GTHCEEVSSLAEHKLKDVREKMADLARMETVLSELVCACHARKGNVSCP SAWSHPQFEK 
(SEQ ID NO: 5) 

10 

MTHCEEASSLVEHKLKDVREKTMADLARMETVLSELVCACHARKGNVSCPLIASLQGSS 
GTHCEEASSLVEHKLKPyREKMADLARMETVLSELVCACHARKGNVSCP SAWSHPOFEK 

(SEQ ID NO: 6) 

15 

MTHCEEASSLAEHKLKDVRETMADLARMETVLSELVCACHARKGNVSCPLIASLQGSS 

GTH C E E AS S L AE H KL KD VRE TMAD L ARME T VL S E LVC ACHARKGNVS C P SAWSHPOFEK 
Q (SEQ ID NO: 7) 

MTHCEEASSLAEHKLKDVREQMADLARMETVLSELVCACHARKGNVSCPLIASLQGSS 
.''^ GTHCF.FIASSLAEHKLKDVREOMADLARMETVLSELVCACHARKGNVSCP SAWSHPQFEK 
n (SEQ ID NO: 8) 

- j; - 

MTHCEEASSLAEHKLKDVREKMADLARVETVLSELVCACHARKGNVSCPLIASLQGSS 
S rtTT^rFF.A.q<^T.AF.HKT.KDVREKMADLARyETVLSELVCACHARKGNVSCP SAWSHPOFEK 
U (SEQ ID NO: 9) 

U MTHCEEASSLAEHKLKDVREKMADLARIETVLSELVCACHARKGNVSCPLIASLQGSS 
GTHCEEASSLAEHKLKDVREKMADLARIETVLSELVCACHARKGNVSCPSAWSHP^^F^ 
(SEQ ID NO: 10) 

35 

MTHCEEASSLAEHKLKDVREKMADLARMETVLSELVCACHARKGNVPCPLIASLQGSS 
nTTTPT^P.A.c-.qT.AF.TTKT.KnVPF.KMADT.ARMFTVLSELVCACHARKGNVPCP SAWSHPOFEK 
(SEQ ID NO:ll) 

40 

MTHCEEASSLAEHKLKDVREKMADLARMETVLSELVCACHARKGNVSCPLIALLQGSS 
GTHCEEASSLAEHKLKDVREKMADLARMETVLSELVCACHARKGNVSCPSAWSHP^ 
(SEQ ID NO: 12) 
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Table 3 

Hg-binding by Resin-tethered MerR and Chelon (SEQ ID NO:4) 



Protein 


moles Hg bound/mole protein 


chelon 


0.490 


MerR 


0.194 



10 



Table 4 

Metal Binding by Chelon (SEQ ID NO:4) 



45 



c:eo 



Metal 


moles bound/mole protein 


arsenic 


below limit of detection 


cadmium 


0.66 


cobalt 


0.79 


copper 


0.95 


iron 


below limit of detection 


lead 


0.93 


nickel 


0.75 


zinc 


0.38 



25 
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Table 5 A. 



Primers for construction of pASK-MBD: 

5 Product 1 : 

Forward: 5' TGCGGCGGTCTCAAATGACACACTGCGAGGAGG 3' (SEQ ID NO: 13) 
Reverse : 5 ' GCCTGAGG ATCCCTGTAGTGACGCG ATC A ACGG 3 ' (SEQ ID NO : 1 4) 

Product 2 : 

10 Forward: 5' CTACAGGGATCCTCAGGCACCCACTGCGAG 3' (SEQ ID NO: 15) 

Reverse : 5 ' CTGT AGGGTCTCGGCGCTCGGGC AGGAAAC ATT 3 ' (SEQ ID NO : 1 6) 

Table 5B. 

1 5 Sequence of pASK-MBD gene (SEQ ID NO: 17) 

3 ATGACACACTGCGAGGAGGCCAGCAGCCTGGCCGAACACAAGCTCAAGGACGT 

D GCGCGAGAAGATGGCCGACTTGGCGCGCATGGAAACCGTGCTGTCTGAACTCGT 

3 GTGCGCCTGCCATGCACGAAAGGGGAATGTTTCCTGCCCGTTGATCGCGTCACT 

io AC AGGGATCCTC AGGC ACCC ACTGCGAGGAGGCC AGC AGCCTGGCCG AAC AC A 

2 AGCTCAAGGACGTGCGCGAGAAGATGGCCGACTTGGCGCGCATGGAAACCGTG 

[j CTGTCTGAACTCGTGTGCGCCTGCCATGCACGAAAGGGGAATGTTTCCTGCCCG 

5 AGCGCTTGGAGCCACCCGCAGTTCGAAAAATAA 

H25 



24 



Table 5C. 



Adjacent plasmid regions with enzyme cutting sites labeled (SEQ ID NO: 18) 



CCRTCGnnTGGCCRGflTGnTTnnTTCCTnfiTTTTTGTTGflCRCTCTnTCRTTGnTnGRGTTnTTTTfiCCflCTCCCTflTCRGTGflTRGRGflRRflGTGflflRTGR 

Xbai BstXi 

RTRGTTCGRCRflflRflTCTflGflTflRCGRGGGCRRRflRRTGRCflCRCTGCGflGGRGGCCRGCflGCCTGGCCGRRCRCfiRGCTCRRGGflCGTGCGCGflGRRGflTG 

► Me tThrH i sCysGluG I u A I aSerSerLeuA 1 aG I uH i sLysLeuLys AspVa I ArgG I uLysMe \ 

BssHil 

GCCGRCTTGGCGCGCRTGGRRRCCGTGCTGTCTGRRCTCGTGTGCGCCTGCCRTGCflCGRRRGGGGRRTGTTTCCTGCCCGTTGRTCGCGTCRCTRCRGGGR 

► a I a Asp Leu A I aArgMe tG I uThrVa I LeuSerC I uLeuVa I CysA I aCysH t s A I aArgLysG I yAsnVa I SerCysProLeu I I eA I aSerLeuG I nG I y 

BstXi BssHII 

TCCTCRGGCRCCCRCTGCGRGGRGGCCflGCRGCCTGGCCGRRCRCRRGCTCRRGGflCGTGCGCGRGRRGRTGGCCGRCTTGGCGCGCRTGGRflfiCCGTGCTG 

► SerSerGlyThrHi sCysGI uGl uAl aSerSerLeuA laG luH i sLysLeuLys AspVa I ArgGl uLysMe tAlaAspLeuAl aArgMe tG I uThrVa t Leu 

Hindm 



► SerGluLeuVa I CysA I aCysH i sA I aArgLysG I yAsnVa I SerCysProSer A I aTrpSerH i sProG I nPheGluLys 

Lighter print: Chelon coding region; followed in regular print by sequence encoding 10 
amino acid-tail (Strep-tagll) at the C terminus of the MBD. 
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